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SEPARABILITY OF AGRICULTURAL COVER TYPES 


IN SPECTRAL CHANNELS AND WAVELENGTH REGIONS 
R. Kumar 
ABSTRACT 

The purpose of this study was to evaluate the spectral 
djannels as well as wavelength regions - visible, near infrared, 
middle infrared and thermal infrared - with respect to their estimated 
probability of correct classi ■‘"ication (P^) in discriminating 
agricultural cover types. Mul tispectral scanner data in twelve spectral 
channels in the wavelength range of 0.4 to 1 1 . 7 vim acquired in the 
middle of July for three flightlines wore analysed by applying 
automatic pattern recogni tion tochniquer. . The same analysis was 
performed for the data acquired in the middle of August, 1971, over the 
same three flightlines, to investigate the effect of time on the 
results. The effect of deletion of each spectral channel as well as 
each wavelength region on P^ is given. Values of P^ for all possible 
combinations of wavelength j'egions in the subsets of one to twelve 
spectral channels are also given. The over-all values of P^ v/ere found 
to be greater for the data of the middle of August than the data of 
the middle of July. 


INTRODUCTION 

The purpose of this study was to determine lIio 
statistical separability of mul ti spectral measurements from 
agricultural cover types for evaluation of spectral channels as well as 


'Hrii.' u'ork icais ruppoj'tea hy the Jni'tituto ch Vei'quii'>as hepadaii' 
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wavelength regi ons -- visible, near infrared, middle infrared and 
thermal infrared. The data were arialysea in subsets of one to tv/elve 
spectral channels, in the wavelength range 0.46 to 11.7 pm for 
selected flightlines of the 1971 Corn Blight Watch Cxperimetit^ . The 
agricultural cover types selected were: corn, soybeans, green forage 
(hay & pasture), and forest. In particular, the objectives of the 
study wore: (1) to study the effect of deletion of each of the twelve 
spectral channels as well as each of the wavelength regions (visible, 
near infrared, middle infrared, and thermal inf. ared), on the 
statistical separability and correspondi ig estimated probability of 
correct classification of the agricultural cover types. (2) To 
develop a criterion for a combination of wavelength regions, based 
on the estimation of its probability of correct classification of 
agricultural cover typos. Based on this criterion, evaluate all 
possible combinations of wavelength regions in tiie subsets of one to 
twelve spectral channels out of the twelve available ones. (3) To 
investigate the effect of time on these results. 


LITERATURE REVIEW 
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Kumar (1972) has done a thot-ough review of the general 
area of 'reflection and emission from plants'. The >'esults of percent 
correct classification, obtained from the analysis of mul ti spectral 
scanner (MSS) data by applying pattern recognition techniques, for a 
flight line divided into four classes (soybeans, corn, water and a 
mixture of stubble, diverted acres and pasture) were repo»'ted in the 
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LARS annual report (1970) . These are summarized as follows: 


Performance Percent Correct Classification Using 
the Following Wavelength Bands 

3 Visible 3 Visible 3 Visible 

1 Reflective IR 2 Reflective IR 2 Reflective IR 

1 Thermal IK 


Training fields 
Test fields 
Training field 


86.8 
82 . 8 

= 4303 sample points 


91.9 93.6 

83.9 8o.4 

Test fields = 7135 sample points. 
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where 

channel 1 = 0.40 - 0.44 ym, channel 2 = O.j/ - 0.58 yin, 

channel 3 = 0.66 - 0.72 yni, channel 4 = 0.30 - 1.00 ym, 

channel 5 = 1.50 - 1.80 ym, channel 6 = 8.00 -14.0 ym. 
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Coggeshall and Hoffer (1^73) have analyzed the 
mul ti spectral scanner data of a flightline having mostly fores^. 

They have investigated the following in much detail; 1) determination 
of the optimum number of the 12 available mul tispectral scanner (MSS) 
wavelength bands to use for forest cover mapping with automatic data 
processing (ADP) techniques; 2) determination of the current capability 
to map basic forest cover types using MSS data and ADP techniques; and 
3) determination of the relative utility, to forest cover mapping, of 
the four spectral regions available in the twelve channel MSS data 
(i.e., visible, and near, middle and thermal infrared). 

They concluded from the tests of classification accuracy 
of six cover types of interest (deciduous forest, coniferous forest, 
v;ater, forage, corn and soybeans), that the use of five v/avelength 
bands would fulfill the dual requirements of adequate accuracy and 
moderate computer time. Their results also indicated that the thermal 
infrared wawlength region is desirable, but not necessary, for forest 
cover mapping, and that accurate classification of deciduous and 
coniferous forest cover can be achieved with the visible plu: either 
the near or middle infrared spectral regions. Hov;ever, the deletion of 
the thermal infrared region caused considerable confusion among the 
agricultural cover types. 
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Kumar and Silva (1974) have investigated the statistical 
separability of the spectral classes of blighted corn in much detail, 
data quantity (168 fields having 18304 sample points in ten flightlines) 
and depth. They found that the greater tlie difference between the 
blight levels, the more statistically separable they usually were. In 
addition, they found that the spectral classes of corn (healthy and 
blighted) were most separable in the wavelength range 1.00 to 1.40 ym. 
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fAQC SlAtSIK NOT 

Bauer (1974)^ has discussed tlie results of 'wavelength band selection', 
obtained in the Corn Blight Watch Experiment. 

Kumar atid Silva analyse.;! the nul t i spec.t ral 
scanner data in the wavelength range 0,4 to 11.7 iim for three 
flightlines. They found that in the subsets of one to six spectral 
channels, the combination of wavelength regions (where V, N, M and T 
denote the visible, near infrared, middle infrared and thermal infrared 
wavelength regions, respectively): V, VM, VNM, VNMT, VVNMT, VVNMMl , 
respectively, were found to be the best choices for getting good 
overall statistical separability of the agricultural cover types for 
the data acquired on July 16 as well as August 12. An effort was made 
to explain these results on the basis of spertral properties of 
agricultural cover types. The overall statistical separability of the 
agricultural cover types was found to be greater for the data of 
August 12 than the data of July 16. The author felt a definite need 
for a further analysis of similar nature to evaluate explicitly each 
spectral channel, each wavelength region and all possible combinations 
of wavelength regions, for statistical separability and the 
corresponding probability of correct classification. 

METHOD OF ANALYSIS 

Mul ti spectral scanner data in twelve spectral channels 
in the wavelength range 0.4 to 11,7 pm, collected with an optical- 
mechanical scanner at altitudes of 900 to 2100 meters (3000 to 7000 
feet) over Western Indiana were analyzed by applying automatic pattern 
recognition techniques. Ihe wavelength bands of these twelve spectral 
channels are given in Table I. The data of three selected flightlines, 
acquired in the middle of July of 1971, were analysed. Each of these 
three flightlines had fair or good amounts of each of the four 
agricultural cover types: corn, soybeans, green forage and forest. 

These three flightlines were selected carefully so that these combined 
could be considered to be representati ve of the four agricultural cover 
types in Western Indiana. 
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Black and white photography and gray-scale printouts of 
the spectral channels of the flightlines were used to aid in locating 
the boundaries of the fields on the LARS (Laboratory for Applications 
of Remote Sensing, Purdue University) Digital Display*. Sufficient 
number of fields of each agri cul tural cover type were selected careful ly so 
that they could be assumed to be representati ve of the flight line. 

Using the same three flightlines and twelve spectral 
channels, an i den ti ca l analysis was performed on the data acquired in 
the middle of August, 1971, to study the effect of time on the 
statistical separability of agricultural cover types. The mul tispectral 
scanner data was acquired on both dates (middle of July and middle of 
August) between 10:30 a.m. and 12:05 p.m. (local solar time). In 
addition, these data were of good quality and free from problems like 
lack of sufficient ground observations , excessive cloud cover, etc. 

The analysis was done for the data acquired in the middle of July and 
the middle of August, because corn and soybeans have reached their 
maximum vegetative growth by these times, and one month of time is 
sufficient for significant changes to occur in the spectral properties 
of agricultural cover types. The author wanted to avoid the analysis 
of data taken from late September on, because soybeans are harvested 
in September-Oc tober. The author tried to keep all the variables, 
other than time, uniform in the two (middle of July and middle of 
August) sets of data. For example, an effort was made to select the 
same field boundaries for the two sets of data. A total of more than 
600 fields taken from three fliglitlines were analysed. 

Each fie'.l was treated as an indepctident unit and the 
fields of the same agricultural cover type were put in the same class. 


* Tiw L/U\S Digital Dir.play ir, a .'p^-i-ial Ly daiupned image display 
system Linked to an IBL! ZBO Model G7 usina x ^tatnode ray tul’t- as. 
the pictorial mediivn for gr>ay Si-alc multi spsceral imagery. 
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The sample points within each field were highly correlated. The 
LARSYS** statistics algorithm was used to compute the mean vector 
and covariance matrix (mean and standard deviation) of the classes. 

In the LARSYS statistics algorithm, cluster algorithm, and feature 
selection algorithm, each sample point is treated independently it' 
order to make the system convenient and flexible for usage. A key 
assumption made in these algorithms is that the distributions of the 
classes are Gaussian. Histograms of the spectral classes defined above 
were used to check unimodality of the statistical distributions in 
individual channels. The classes were redefined to eliminate distinct 
multiple modes. Divergence is defined for any two density functions. 

In the case of nomial variables with unequal covariance matrices, 

9 

divergence in n spectral channels is given by 

D(iJlC,,C2,...C,P = 1/2 t. {(!;. - - £■'))+ '/2 + sT' ) 

(U. - up (Uj - up^) (1) 


where 

U and T. represent the mean vector and covariance matrix respectively: 
tr A (trace A) is the sum of the diagonal elements of A. 

A modified form of the divergence D--, referred to as 

8 10 ' 

"transformed divergence", has a behavior ’ more like the probability 
of correct classification than the divergence, D. 

= 2(1 - exp(-D/8)} (2) 

Transformed divergence has been used throughout this study. 

Altliough divergence only provides a measu»'e of the 
distance between two class densities, its use has been extended to the 


** LMtISYH ir> t’r.c tPCiVth datu p/M*. rod:Ju\' ' 

the Lahovei^evu for Appi' ieoi ioKC of lu r 'te :\ > c;une, P:o\iu ' v, r.\' t 'j , 
W. LofayettCj Indiar.a. 
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multiclass case hy taking the average over all pairs^^. Let D^.| denote 
the divergence between classes i and j of a certain flightline, then 
the average divergence over all class pairs of four* classes (each 
agricultural cover was treated as a separate class) is given by 

[) = — 

‘tAVLi 0 

L i 


TIP 


I 1 a 


^114 * ^ 


^lc4 ^ ^Va4 


(3) 


let •• mini mum of UV].* i i* ^ 


TIJ 


n -, , l/y It -I- > > , » 1 * T i * t ** ) 

a 114, 1 1.0 1 ,4 1 o‘< ' ' 


Swain*" ^147v) lias pointed out that one possible strategy 
is to select the subset of features for winch the average transformed 
divergence, maximum. While this strategy is certainl> reason- 

able, there is no guarantee that it is optimal. Another strategy is to 
nnvimice the minimum divergence, i.e., to select the featun- 

combination which provides the greatest separation between the hardest- 
to-separate pair of classes. 


Let superscripts 1 and with the symbol "Dy" denote the 
values of transformed divergence for the data acquired in miudle of 
duly and middle of August respect i \ely . Let 

dI,,,,,., be the values of '•'4* 1*0) >•' fit'st, second and 

I N 1 II 0 •i’ll I ’ 

third flightline respectively for the uata acquired in middle of July. 


^''tAVC. "T ^VaVlII ^ ^^lAU'.d ^ ^TAVGJ ' ^ 


v3) 


" nn 111 mum of [Gy.uf^l. ^ ~ O') 


The LAKSYS feature selection processor was used to find 

nj,,, , and p!,.,,, in all possible ». ombinations of one to twelve s[iectral 
lAVii IMIrl — 

channels out of the available twelve. 
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Throughout this analysis, the combinations of one 
through twelve spectral channels wore ranked so as to get the 
descending order of '^or the data of middle of July and middle 

of August respectively. In other words 

^Tl^lAX spectral channels} = max (7) 

maximized over all possible subsets of r spectral channels out of tiie 
available twelve for data of middle of July, from the values of the 
average transformed divet'gence, classi f icatiof) accuracy can be 
reasonably predicted f>'om the results of Swair et.al. ( 1973)^*^. 

Table I gives the \;avelength interval and the 
corresponding wavelength region of each, of the twelve spectral channels. 
Tables II and III give, respecti '.ely , the effect of deletion of each 
of the twelve spectral cliannels in each of the four wavelength regions 
on as well as 

To fulfill otic of the main objectives of the study -- 
evaluation of all possible combinations of wavelength regions in the 
subsets of one to twelve spectral cliai.nels -- the following criterion 
is proposed: 

Each of 12 available cliannels of the mul t i spectral scanner can be 
placed in one of the four wavelength regions -- visible, near 
infrared, middle infrared and thermal infrared, as shown in 
Table I. Thus, any combination of spectral channels can ht? cahed 
ub the correspondi tig combination of the waveletigth regions, lor 
example, channel combination 1,8,10 and 12 is called "combination 
of visible, neat* infrared, middle infrared and thermal infrared 
waveletigth regions", and is denoted by V 11 M T. For a given 
combination of wavelength regions, for example V N M T; 
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^TAVG corresponding values of P^, using the curve of 

Swain et. were calculated for all possible combinations 

of four spectral cnannels out of twelve available ones that 
constitute the combination V N M T. The mean of these values of 
was calculated for all possible combinations of wavelength 
regions in the subsets of one to twelve spectrcil channels, at»d 
is shown in Table IV for the data of middle of July as well as 
middle of August. 


RtSLILlS A:in DISCUSSION 

The overall separability of green forage from the other 
agricultural cover types was found to be »..ortsiderably lower than the 
correspoiuiing separability of co»'ti, soybeans and forest, because the 
standard deviation of the mean response of green forage was largest 
anxMig the agi'icul tural cover types, This is because there was much 
natural variability in the spectral characteristics of hay as well as 
pasture. The overall separability of forest from othei' agricultural 
cover types was found to be considerably higher than the corresponding 
separability of corn, soybeans and green forage. Co'een forage and corn 
were hard to separate, because of considerable overlap in the values 
of their mean response, due to tiie large standard deviatioii of green 
forage. In addition, it was harder to separate corn from soybeans for 
the data of middle of July than for the data of miudle of August. 

Table II shows tliat for greatest overall statistical 
separabi 1 i ty , channel 7(0.61 to0.70 urn, red channel) seems to be the 
best channel. It should be pointed out that the predominant pigments 
of the plant leaf absorb ifi the vicinity of O.-M ,im, but only 
chlorophyll absorbs in the red, in the vicinity of 0.64 um. The 
reason for channel 7 boitig the best channel may I'O Uiat tlicre are 
significant differences in the chloropliyll content of diffet'ent 
agricultural cover typos, which give r-ise to difference' in their mean 
response in channel 7, and iience a relatively large val ’e of average 
transformed divergence between them. An additional reaS' n may be that 
the red wavelength I'egion is extremely favorabli for gualitative and 
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quantitative description of soils 

Table II also shows that deletion of chajinel 7 reduces 
P^. by about two percent for the data of middle of July as well as 
middle of Auqust. The deletion of each of the other channels causes 
no reduction, or less reduction in the values of P., as Lom[)ared to 
channel 7. Thus, ocletion of any one of the twelve cnannels does not 
cause any substantial decrease in values ol in the subsets or one 
to eleven channels. 

As one would expect. Table III shows tiia. the greatest 
separability of the agricul tuf\tl cover types is otifained by using all 
of the twelve channels, liowevc**, an increase i!i the number of chwOinels 
used in a classification algorithm requitvs a di spro[)ort ioiiate 
rncrease in computer time . Wihout doing a detailed analysis, it 
seems from Table III that tl'.c subset of five chatinels is likel. to 
fulfill the dual jvquirements of adequate classi f iv at ion accuracy aiu: 
nK>derate computer tiiu*. However, this cof'clusien is very 'M-el iiinna'*;. 
because no cost eenefit ana I> sis for the data was done. 

Table III shows that deletion of eacn of the w.;vo 1 '.‘noth 
regions causes the following r.uxiaium reductions in p fc>r the dat.i 
acquired in the middle of Jul\ : visibli' subset of four ciuinncls), 

near infrared (O.oK, subset of ten chantielsl, luidulo infraivd (1.10, 
subset of two channels), ther-mal infrarovt (1.0.', >ubset of four 
char'.nels). The correspond irig values of reducfiotis in P^ for the data 
acquired in the middle of Augu'st are: visible ^0.07, subset of o'u* 
channel), near infrared (0.41, subset of ten channels), middle 
infra»*ed (l.lo, subset of two channels), thermal infrared ^0.3'\ 
subset of eight chamiel s). Thus , it appears that deletion of tne visible 
wavelength region causes more reduction in as compared to any ot 
the other wavelength regions. Tlie deletion of near infrareo wavelength 
regioti apparently causes relatively small changes in the valut's of P.. 


Table IV shows that in the sul'sets of one to six SjCv tral 


cha.-mels, the combination of v-ave length regions V, VM» VMT, VfiMT, VVNiMT, 
WNMI'Pi are found to be the best choices for tne claua of middle of 
duly. Sl'iiilrrly, for the daro acquired in the middle of August: T, 
flT, VNT or VMT, VNi'lT, \ZVNMT or VilMiiT, VVNMMT are found to be the 
best choices. These resilts ai‘o similar to the results obtained by 
Kumar and Sil/a ai : different criterion of evaluation of 

combinations of wavei.'iigi.n regions was used by them. Obviously, all 
the seven channels in the visible wavelength region in this 
mi:l ti spectral scanner (MSS) are not necessary for getting good 
separability of the agricultural cover types. 

In the data of middle of July as v/ell as middle of August, 
VNMT is found to be the best choice in the subsets of four channels. 

It indicates that each wavelength region is valuable in its ovrn way; 
the visible wavelength region, for instance, is valuable because the 
predominant plant pigments absorb in this wavelength region. Because of 
tlie presence of water absorption bands in the middle infrared, surface 
gcometr’y of the target and the moisture content of its top layers (of 
the order of micrometers) determine its reflectance in the middle 
infrared. This region is valuable because there are significant 
differences in the geometry of tlie surface and/or moisture content 
of top layers of the agricultural cover types. The near infrared 
wavelength region is useful because substantial contrasts between the 
agricultural covers and soils occur in this r'egion. Thus, this region 
is especially useful when there arc substantial differences in the 
percentage ground covers of tlie agricultural cover types. The thermal 
channel (channel 12) contains information about the radiant temperatures 
of the targets in the wavelength region 9.3 to 11. / yin. There are found 
to be significant differences i.. the radiant temperatures of the 
agricultural cover types. The ratriont temperat ure of a plant, can be 
found by do'.q an o'^ey'oy balance on it, und it depends upon factors 
such as: ranic en incident u.; the plant, plant geometry and size, 
spectral ,'ropcrtif:s of the plant (including soil), percent ground 
cover, conv^-ction coefficient and transpi ration rate of the plant, etc". 
There are 1 tc be di f ie; in the values of the above variables 
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for different agricul tur»Tl cover types that give rise to differences 
in their iwdiant teiiH)era lures . Ihus, in the sul)set of six spectral 
channels, irrespective of what otlier five channels were used, adding 
a thermal channel usually increased the overall separability of the 
agricultural cover ty[’es. 

In conclusion, it should be said that determining which 
combinations of one, two,..., eleven specti'al channels, out of twelve 
available spectral channels, give greatest ovt'rall statistical 
separability of the agricultural cover typi'*. is a complex I'roblem, 
because statistical separabi 1 i ty , in any given combination of spectral 
cluuinels, depetuls u[ion many variables, such us quality of data in the 
spectral cfiannels, quality and quatitity tif the ground truth available, 
time of acquiring the data, human decisions (numt)er of fields, field 
boundaries to be selected, etc.) envi romiii'nta 1 variables, soil variables, 
etc, , llew('ver-, detetmii iiiiuj wliii.h combinations of one tlmnigh six 
wavelength regions give greatest overall statistical sepai\»bi 1 i ty of 
the agricultural cover typi's is a rvlatively less complex ()roblem 
(Table IV). It should be pointed out, althougli the f 1 i glit.l i nes analysed 
had considerably different character'i st ics than the flightline 

4 

analyzed l)y Coggeshall and lloffer (Introduction), Uiat many of ttic 
conclusions presented in this paper art* the same as ol)tained by them. 
This means that, although the analysis was done I'or tliree flight lines, 
tlje results obtained may V';ell be applicable to other fliglitlines liaving 
considerably different character istics than these. Ihe overall 
statistical separability of ttio agricul tural cover types was found to 
be greater for tlie d.Ua of middle of August than for the dat.i oi middle 
of July. It should be noted that many of the results oluained f)'om the 
analysis of the data of middle of July are the same as those obtained 
from tlie data of middle of August. This means that, a I thuugti tite 
analysis was done for the data acquired at two different times, llu} 
results obtained from this analysis may well be a('plical)le to the ilata 
accjuiri'd at some' other times of the year. 
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TABLE III. 


EFFECT OF OELETION OF LACH WAVELENGTH REGION ON THE 
PERCENTAGE PROBABILITY OF CORRECT CLASSIFICATION 
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